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CAGCGTCAGACGCAGGGCACTGAGAATGTGCGACAGCGCGCAACGATGAAGTAGCCCAGAGGGTCCCTTG 

GAAAATGAGGCCAGGGTCCCTGCTGCTGCTTGTTCTGCTGCTCGCCCTGTCCAGGAGCCTGCGGGGCAAA 

GAGTGTGCGTCTCCACCCTGTGAGTGTCACCAGGAGGACGACTTCAGAGTCACCTGCAAGGAGCTCCACC 

GAATCCCCAGCCTGCCGCCCAGCACCCAGACTCTGAAGCTCATCGAGACTCATCTGAAGACCATACCCAG 

TCTTGCATTTTCGAGTCTGCCCAATATTTCCAGGATCTATTTATCTATAGATGCAACTCTGCAGCGGCTG 

GAACCACATTCTTTCTACAATTTGAGTAAAATGACTCACATAGAAATCCGGAACACCAGAAGCTTAACCT 

ATATAGACCCTGATGCCTTGACAGAGCTCCCCTTGCTCAAGTTTCTTGGCATTTTCAATACTGGACTTAG 

AATATTCCCTGACTTGACCAAAATTTATTCCACGGACATATTCTTTATACTTGAAATCACAGACAACCCT 

TACATGACTTCGGTCCCTGAAAACGCATTCCAGGGCCTATGCAATGAAACCTTGACCCTGAAACTGTACA 

ACAATGGATTTACTTCAGTCCAAGGACATGCTTTCAATGGAACAAAGCTGGATGCTGTTTACCTAAACAA 

GAATAAATACCTGACAGCTATAGACAACGATGCCTTTGGAGGAGTATACAGTGGACCAACTTTGCTAGAT 

GTGTCTTCCACCAGCGTCACTGCCCTTCCTTCCAAAGGCCTGGAGCACCTCAAAGAACTGATCGCAAAAG 

ACACCTGGACTCTCAAAAAGCTCCCGCTGTCGTTGAGTTTCCTCCACCTCACTCGGGCTGACCTCTCTTA 

CCCGAGCCACTGCTGCGCTTTTAAGAACCAGAAGAAAATCAGGGGAATCCTGGAGTCTTTGATGTGTAAT 

GAGAGCAGTATCCGGAACCTTCGTCAAAGGAAATCAGTGAACATCTTGAGGGGTCCCATCTACCAGGAAT 

ATGAAGAAGATCCGGGTGACAACAGTGTTGGGTACAAACAAAACTCCAAGTTCCAGGAGAGCCCAAGCAA 

CTCTCACTATTACGTCTTCTTTGAAGAACAAGAGGATGAGGTCGTTGGTTTCGGCCAAGAGCTCAAAAAT 

CCTCAGGAAGAGACTCTCCAAGCCTTCGAGAGCCACTATGACTACACGGTGTGTGGGGACAACGAGGACA 

TGGTGTGTACCCCCAAGTCGGACGAGTTTAACCCCTGTGAAGATATCATGGGCTACAGGTTCCTGAGAAT 

CGTGGTGTGGTTTGTCAGTCTGCTGGCTCTCCTGGGCAATATCTTCGTCCTGCTCATTCTGCTAACCAGC 

C AC TAC AAATTGACCGTGC C GC GGTTC C TC ATGTGC AAC TTGGC C TTTGC AG ATTTC TGC ATGGGGGT AT 
ACCTGCTTCTCATTGCCTCTGTAGACCTGTACACACACTCTGAGTACTACAACCACGCCATCGACTGGCA 

GACGGGCCCTGGGTGCAACACGGCTGGCTTCTTCACTGTTTTCGCCAGTGAGTTATCAGTGTACACACTG 
ACGGTCATCACCCTGGAGCGATGGTACGCCATCACCTTCGCCATGCGCCTGGATAGGAAGATCCGCCTCA 
GGCACGCGTACACCATCATGGCTGGGGGCTGGGTTTCCTGCTTCCTTCTCGCCCTGCTCCCGATGGTGGG 
AATCAGCAGCTATGCCAAGGTCAGCATCTGCCTGCCAATGGACACCGACACCCCTCTTGCACTCGCATAC 
ATTGTCCTCGTTCTGCTGCTCAATGTTGTTGCCTTTGTTGTCGTCTGTTCCTGCTATGTGAAGATCTACA 
TCACGGTCCGAAATCCCCAGTACAACCCTCGAGATAAAGACACCAAGATTGCCAAGAGGATGGCTGTGTT 
GATCTTC AC TGACTTC ATGTGC ATGGCGCCCATCTCCTTCTATGCGCTGTCGGCACTTATGAACAAGCCT 
CTAATCACTGTTACTAACTCCAAAATCTTGTTGGTTCTCTTCTACCCCCTCAACTCCTGTGCCAATCCGT 
TTCTCTATGCTATTTTCACCAAGGCCTTCCAGAGGGACGTGTTCATCCTGCTCAGCAAGTTTGGCATCTG 
CAAACGCCAGGCCCAGGCCTATCAGGGTCAGAGAGTCTGTCCCAACAATAGCACTGGTATTCAGATCCAA 
AAGATTCCCCAGGACACGAGGCAGAGTCTCCCCAACATGCAAGATACCTATGAACTGCTTGGAAACTCCC 
AGCTAGCTCCAAAACTGCAGGGACAAATCTCAGAAGAGTATAAGCAAACAGCCTTGTAAAGGAAAGGCTA 
CGCTAGTCACAGTGAGACTTACAAAAGGCTGGTTTCTTGAACATGCGTTCCAGTCCCGTGACATGTGAAC 
AC ATAGGTTC ATGC AGGTGATGATTC AT AGGGTC AGAGTTC ATC TC T AG AAAGTATTGC CTC 
(SEQ ID N0:1) 
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MRPGSLLLLVLLLALSRSLRGKECASPPCECHQEDDFRVTCKELHRIPSLPPSTQTLKLIETHLKTIPSLAFSSLPN 
ISRIYLSIDATLQRLEPHSFYNLSKMTHIEIRNTRSLTYIDPDALTELPLLKFLGIFNTGLRIFPDLTKIYSTDIFF 
ILEITDNPYMTSVPENAFQGLCNETLTLKLYNNGFTSVQGHAFNGTKLDAVYLNKNKYLTAIDNDAFGGVYSGPTLL 
DVSSTSVTALPSKGLEHLKELIAKDTWTLKKLPLSLSFLHLTRADLSYPSHCCAFKNQKKIRGILESLMCNESSIRN 
LRQRKSVNILRGPIYQEYEEDPGDNSVGYKQNSKFQESPSNSHYYVFFEEQEDEWGFGQELKNPQEETLQAFESHY 
DYTVCGDNEDIWCTPKSDEFNPCEDIMGYRFLRIVVWFVSLLALLGNIFVLLILLTSHYKLTVPRFLMCNLAFADFC 
MGVYLLL I AS VDL YTH S E Y YNHA I DWQTG PGCNT AGF FTVF AS EL SVYTLTVI TLERWYAI TF AMRLDRKI RLRHAY 
TIMAGGWSCFLLALLPWGISSYAKVSICLPMDTDTPLALAYIVLVLLLNWAFVWCSCYVKIYITVRNPQYNPR 
DKDTKIAKRMAVLIFTDFMCMAPISFYALSALMNKPLITVTNSKILLVLFYPLNSCANPFLYAIFTKAFQRDVFILL 
SKFGICKRQAQAYQGQRVCPNNSTGIQIQKIPQDTRQSLPNMQDTYELLGNSQLAPKLQGQISEEYKQTAL 

(SEQ ID NO: 2) 
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underlined = deleted in targeting construct 

[ ] = sequence flanking Neo insert in targeting construct 

[ C AGC GTC AG AC GC AGGGC AC T G AG AATGTGC G AC AGC GC GC AAC G ATG AAGT AGC C C AG 
AGGGTCCCTTGGAAAATGAGGCC AGGGTCCC ] TGCTGCTGCT TGTTCTGCTGCTCGCCCT 
GTCCAGGAGCCTGCGGGGCAAAGAGTGTGrCTCTCCACCCTGTGA [ GTGTC ACC AGGAGG 
ACGACTTCAGAGTCACCTGCAAGGAGCTCCACCGAATCCCCAGCCTGCCGCCCAGCACCC 
AGACTCT ] GAAGCTCATCGAGACTC ATCTGAAGACCATACCC AGTCTTGC ATTTTCGAGT 
CTGCCCAATATTTCCAGGATCTATTTATCTATAGATGCAACTCTGCAGCGGCTGGAACCA 
CATTCTTTCTACAATTTGAGTAAAATGACTCACATAGAAATCCGGAACACCAGAAGCTTA 
ACCTATATAGACCCTGATGCCTTGACAGAGCTCCCCTTGCTCAAGTTTCTTGGCATTTTC 
AATACTGGACTTAGAATATTCCCTGACTTGACCAAAATTTATTCCACGGACATATTCTTT 
ATACTTGAAATCACAGACAACCCTTACATGACTTCGGTCCCTGAAAACGCATTCCAGGGC 

C T ATGC AATG AAAC C TTG AC C C TG AAAC TGT AC AAC AATGG ATTT AC TTC AGTC C AAGG A 
CATGCTTTCAATGGAACAAAGCTGGATGCTGTTTACCTAAACAAGAATAAATACCTGACA 
GCTATAGACAACGATGCCTTTGGAGGAGTATACAGTGGACCAACTTTGCTAGATGTGTCT 
TCCACCAGCGTCACTGCCCTTCCTTCCAAAGGCCTGGAGCACCTCAAAGAACTGATCGCA 
AAAGACACCTGGACTCTCAAAAAGCTCCCGCTGTCGTTGAGTTTCCTCCACCTCACTCGG 

GC TG AC C TC TC TT AC C C G AGC C AC TGC TGC GC TTTT AAG AAC C AG AAG AAAATC AGGGG A 
ATCCTGGAGTCTTTGATGTGTAATGAGAGCAGTATCCGGAACCTTCGTCAAAGGAAATCA 
GTGAACATCTTGAGGGGTCCCATCTACCAGGAATATGAAGAAGATCCGGGTGACAACAGT 
GTTGGGTACAAACAAAACTCCAAGTTCCAGGAGAGCCCAAGCAACTCTCACTATTACGTC 
TTCTTTGAAGAACAAGAGGATGAGGTCGTTGGTTTCGGCCAAGAGCTCAAAAATCCTCAG 
GAAGAGACTCTCCAAGCCTTCGAGAGCCACTATGACTACACGGTGTGTGGGGACAACGAG 
GACATGGTGTGTACCCCCAAGTCGGACGAGTTTAACCCCTGTGAAGATATCATGGGCTAC 
AGGTTCCTGAGAATCGTGGTGTGGTTTGTCAGTCTGCTGGCTCTCCTGGGCAATATCTTC 
GTCCTGCTCATTCTGCTAACCAGCCACTACAAATTGACCGTGCCGCGGTTCCTCATGTGC 
AACTTGGCCTTTGCAGATTTCTGCATGGGGGTATACCTGCTTCTCATTGCCTCTGTAGAC 
CTGTACACACACTCTGAGTACTACAACCACGCCATCGACTGGCAGACGGGCCCTGGGTGC 
AACACGGCTGGCTTCTTCACTGTTTTCGCCAGTGAGTTATCAGTGTACACACTGACGGTC 
ATCACCCTGGAGCGATGGTACGCCATCACCTTCGCCATGCGCCTGGATAGGAAGATCCGC 
CTCAGGCACGCGTACACCATCATGGCTGGGGGCTGGGTTTCCTGCTTCCTTCTCGCCCTG 
C TC C C GATGGTGGGAATC AGC AGC T ATGC CAAGGTC AGC ATC TGC C TGC CAATGGAC ACC 
GACACCCCTCTTGCACTCGCATACATTGTCCTCGTTCTGCTGCTCAATGTTGTTGCCTTT 

GTTGTC GTC TGTTC CTGC TATGTGAAG ATC TAC ATC ACGGTC C GAAATC C C C AGTAC AAC 
CCTCGAGATAAAGACACCAAGATTGCCAAGAGGATGGCTGTGTTGATCTTCACTGACTTC 
ATGTGCATGGCGCCCATCTCCTTCTATGCGCTGTCGGCACTTATGAACAAGCCTCTAATC 
ACTGTTACTAACTCC AAAATC TTGTTGGTTCTCTTCTACCCCCTC AAC TCCTGTGCCAAT 
CCGTTTCTCTATGCTATTTTCACCAAGGCCTTCCAGAGGGACGTGTTCATCCTGCTCAGC 
AAGTTTGGCATCTGCAAACGCCAGGCCCAGGCCTATCAGGGTCAGAGAGTCTGTCCCAAC 
AATAGCACTGGTATTCAGATCCAAAAGATTCCCCAGGACACGAGGCAGAGTCTCCCCAAC 
ATGCAAGATACCTATGAACTGCTTGGAAACTCCCAGCTAGCTCCAAAACTGCAGGGACAA 
ATCTCAGAAGAGTATAAGCAAACAGCCTTGTAAAGGAAAGGCTACGCTAGTCACAGTGAG 
ACTTACAAAAGGCTGGTTTCTTGAACATGCGTTCCAGTCCCGTGACATGTGAACACATAG 
GTTCATGCAGGTGATGATTCATAGGGTCAGAGTTCATCTCTAGAAAGTATTGCCTC 
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Gene Sequence Structure 



91 bp 



Sequence Deleted 



163 bp 



Size of full-length 
cDNA: 2512 bp 




Targeting Vector* 
(genomic sequence) 



Arm Length: 
5': 5 kb 
3': 1.3 kb 



^— — Targeting Vector 
_ _ _ _ _ Endogenous Locus 

* Not drawn to scale 



5* arm 



LacZ-Neo 

Cassette 




3* arm 



y probe 



5 ■ > AC TTGAGAGC C TC TC C TTC C C 
CCTCTCCAGCGTGCTCTCCAGCGA 
TGAGGTCACAGCCCCTCGGAGCCC 
TCCTCCTCCCTCCCTTCCCCTCCT 
GC AC C C GGGTC TC TTC C AGCGTC A 
GACGCAGGGCACTGAGAATGTGGC 
GACAGCGCGCAACGATGAAGTAGC 
CCAGAGGGTCCCTTGGAAAATGAG 
GCCAGGGTCCCO' 
(SEQ ID NO: 3) 



5 1 >GTGTCACCAGGAGGACGACTT 
CAGAGTCACCTGCAAGGAGCTCCA 
CCGAATCCCCAGCCTGCCGCCCAG 
CACCCAGACTCTGTGAGTAGCCAA 
GGCCAAGACCCCCCCCCCCCGAGA 
AATTCGTGGTGTGTGTTGGGGTGT 
GCGCGGATATCTGGTCAGTCCCTG 
TACAAATTCAATCCCCCATGCTCG 
GGAAGGTCAGC<3 1 
(SEQ ID NO: 4) 
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